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METHODS 



This invention relates to polymorphisms in the human pyruvate dehydrogenase 
5 complex E2 (PDH E2 or PDC E2) gene and corresponding novel allelic polypeptides encoded 
thereby. The invention also relates to methods and materials for analysing allelic variation in 
the PDH E2 gene, and to the use of PDH E2 polymorphism in the diagnosis and treatment of 
diseases in which modulation of pyruvate dehydrogenase activity could be of therapeutic 
benefit, such as diabetes, asthma, obesity, sepsis and peripheral vascular disease. 

1 0 The production of energy for biosynthesis of complex molecules and for muscle 

contraction is mediated by the hydrolysis of high energy phosphate bonds v^ithin adenosine 
triphosphate (ATP). In oxidative metabolism ATP is generated from acetyl coenzyme A 
(acetyl Co A), which itself is produced by the beta-oxidation of fatty acids, or as a resuU of the 
metabolism of glucose via the glycolytic pathway. The key regulatory enzyme which controls 

1 5 the rate of acetyl Co A formation from glucose is pyruvate dehydrogenase (PDH), which 
catalyses the oxidation of pyruvate to acetyl CoA and carbon dioxide with concomitant 
reduction of NAD to NADH. 

PDH is a multienzyme complex located in the mitochondrial matrix, containing 
multiple copies of three enzyme components required to complete the conversion of pyruvate 

20 to acetyl CoA (Patel and Roche 1990; FASEB J., 4: 3224-3233). El (pyruvate decarboxylase, 
E.C. 1.2.4.1) catalyses the non-reversible removal of carbon dioxide from pyruvate; E2 
(dihydrolipoamide acetyltransferase, E.C. 2.3.1.12) catalyses the formation of acetyl CoA; 
and E3 (dihydrolipoamide dehydrogenase, E.C. 1.8.1.4) reduces NAD to NADH. The El 
enzyme is a heterotetramer composed of two a and two P subimits. Decarboxylation of 

25 pyruvate, catalysed by El is the rate limiting step in the overall activity of the PDH complex. 
This step is also the target for a cycle of phosphorylation and dephosphorylation which forms 
one of the main mechanisms for regulating PDH activity. Two additional enzyme activities 
are also associated with the PDH complex: a specific kinase (PDK) which is capable of 
phosphorylating El a at three serine residues, and a loosely-associated specific phosphatase 

30 which reverses the phosphorylation. Phosphorylation of only one of the three serine residues 
on Ela renders El inactive. Removal of the phosphate groups by the specific phosphatase 
restores activity. Thus, the proportion of PDH in its active (dephosphorylated) state is 
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determined by a balance between the activity of the kinase and phosphatase. The activity of 
the kinase may be regulated in vivo by the relative concentrations of metabolic substrates such 
as NAD/NADH, CoA/acetylCoA and ADP/ATP as well as by the availability of pyruvate 
itself, therefore providing highly regulated, responsive control of PDH activity. 
5 Genetic abnormalities in the PDH complex are the most common cause of primary 

lactic acidosis in humans. The majority of cases have been linked with a defect in the Ela 
subunit. Pathologies associated with defects in the PDH complex conform to a broad clinical 
spectrum ranging from fatal lactic acidosis in the newborn, to a range of chronic 
neurodegenerative conditions with gross structural abnormalities in the central nervous 

1 0 system. Ela deficiency is an X-linked disorder which manifests different pattems of clinical 
presentation between males and females. In addition, heterozygous females show a wide 
variation in clinical severity of the disease, due largely to variations in the pattern of X- 
inactivation and differential effects of specific gene mutations on the expression, stability and 
activity of the mutant protein. A number of mutations in the PDH gene which lead to 

1 5 pyruvate dehydrogenase deficiency have been documented (for a review see NIH OMIM 
database, reference 312170). 

In disease states such as both non-insulin dependent (NIDDM) and insulin-dependent 
diabetes (IDDM), oxidation of lipids is increased with a concomitant reduction in utilisation 
of glucose, contributing to the hyperglycaemia. The activity of PDH is reduced in both 

20 insulin-dependent and non insulin-dependent diabetes. A further consequence of reduced PDH 
activity is an increase in pyruvate concentration resulting in increased availability of lactate as 
a substrate for hepatic gluconeogenesis. Diabetes is further exacerbated by impaired insulin 
secretion, which has been shown to be associated with reduced PDH activity in pancreatic (3- 
cells. It is beheved that increasing the activity of PDH may increase the rate of glucose 

25 oxidation and hence overall glucose utiUsation, in addition to reducing hepatic glucose output. 
Oxidation of glucose is capable of yieldmg more molecules of ATP per mole of 
oxygen than is oxidation of fatty acids, therefore in conditions where energy demand may 
exceed energy supply, such as myocardial ischaemia and reperfusion, intermittent 
claudication, cerebral ischaemia and reperfusion, shifting the balance of substrate utilisation in 

30 favour of glucose metaboUsm may be expected to improve the ability to maintain ATP levels 
and hence function. Activation of PDH is predicted to have this effect. 
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An agent which is capable of activating PDH is expected to be of benefit in treating 
conditions where an excess of circulating lactic acid is manifest such as in certain cases of 
sepsis. 

The agent dichloroacetic acid which increases the activity of PDH after acute 
5 administration in animals (Vary et al., 1988; Circ. Shock, 24: 3-18) has been shown to have 
the predicted effects in reducing glycaemia (Stacpoole et al, 1978 N. Engl. J. Med. 228, 526- 
530) and as a therapy for myocardial ischaemia (Bersin and Stacpoole 1997; American Heart 
Joumal, 124: 841-855) and lactic acidaemia (Stacpoole et al, 1983 N. Engl. J. Med 202, 390- 
396). 

1 0 cDNA sequences encoding PDH E2 have been submitted to public databases under the 

following accession numbers:Y00978, J03866. We believe that sequence J03866 contains a 
number of errors, discussed in more detail in Example 2. All positions in the human PDH E2 
gene herein refer to the positions in SEQ ID NO: 1 (which is equivalent to EMBL accession 
number Y00978 at the time of filing this appUcation) imless stated otherwise or apparent from 

1 5 the context. All positions in the human PDH E2 protein herein refer to the positions in SEQ 
ID NO: 2 unless stated otherwise or apparent from the context. 

DNA polymorphisms may lead to variations in amino acid sequence and consequently 
to altered protein structure and functional activity. Polymorphisms may also affect mRNA 
synthesis, maturation, transportation and stability. Polymorphisms which do not result in 

20 amino acid changes (silent polymorphisms) or which do not alter any known consensus 

sequences may nevertheless have a biological effect, for example by altering mRNA folding 
or stability. 

Knowledge of polymorphisms may be used to help identify patients most suited to 
therapy with particular pharmaceutical agents (this is often termed "pharmacogenetics") , 

25 Pharmacogenetics can also be used in pharmaceutical research to assist the dmg selection 
process. Polymorphisms may be used in mapping the human genome and to elucidate the 
genetic component of diseases. The reader is directed to the following references for 
background details on pharmacogenetics and other uses of polymorphism detection: Linder et 
al (1997), Clinical Chemistry, 43, 254; Marshall (1997), Nature Biotechnology, 11, 1249; 

30 International Patent Application WO 97/40462, Spectra Biomedical; and Schafer et al (1998), 
Nature Biotechnology, 1^, 33. 



i 
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Clinical trials have shown that patient response to treatment with pharmaceuticals is 
often heterogeneous. Thus there is a need for improved approaches to pharmaceutical agent 
design and therapy. 

Variations in polypeptide sequence will be referred to as follows: original amino acid 
5 (using one or three letter nomenclature), position, new amino acid. For (a hypothetical) 
example "D25K" or "Asp25Lys" means that at position 25 an aspartic acid (D) has been 
changed to lysine (K). Multiple variations in one polypeptide will be shown between square 
brackets with individual variations separated by commas. 

The present invention is based on the discovery of two single nucleotide 
1 0 polymorphisms (SNPs) in the coding region of the human PDH E2 gene. 

According to one aspect of the present invention there is provided a method for the 
diagnosis of a polymorphism in PDH E2 in a human, which method comprises determining 
the sequence of the human at one or more of positions 857 and 1255 in the PDH E2 gene as 
defined by the positions in SEQ ID NO: 1 or positions 216 or 349 of PDH E2 protein as 
1 5 defined by the position in SEQ ID NO: 2; and determining the status of the human by 
reference to polymorphism in PDH E2. 
Preferred polymorphisms are any one of the following: 



Position 


Polymorphism 


Reference 


857 


T/C 


SEQ ID NO: 1 


1255 


G/A 


SEQ ID NO: 1 


216 


Val-Ala 


SEQ ID NO: 2 


349 


Asp-Asn 


SEQ ID NO: 2 



According to one aspect of the present invention there is provided a method for the 
diagnosis of a polymorphism in a PDH E2 gene in a human, which method comprises 
20 determining the sequence of the nucleic acid of the human at one or more of positions 857 and 
1255 in the PDH E2 gene as defined by the positions in SEQ ID NO: 1; and determining the 
status of the human by reference to polymorphism in the PDH E2 gene. 

The term human includes both a human having or suspected of having a PDH- 
mediated disease and an asymptomatic human who may be tested for predisposition or 
25 susceptibility to such disease. At each position the human may be homozygous for an allele 
or the human may be a heterozygote. 
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The term TDH-mediated disease' means any disease in which changing the level of 
PDH or changing the activity of PDH would be of therapeutic benefit. 

The term TDH drug' means any drug which changes the level of PDH or changes the 
activity of PDH. A drug which increases the activity of PDH is preferred. 
5 The term polymorphism includes single nucleotide substitution, nucleotide insertion 

and nucleotide deletion, which in the case of insertion and deletion includes insertion or 
deletion of one or more nucleotides at a position of a gene and variable numbers of a repeated 
DNA sequence. 

In one embodiment of the invention preferably the method for diagnosis described 
1 0 herein is one in which the single nucleotide polymorphism at position 857 in the PDH E2 
gene as defined by the positions in SEQ ID NO: 1 is presence of T and/or C. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 1255 in the PDH E2 
gene as defined by the positions in SEQ ID NO: 1 is presence of G and/or A. 
1 5 The method for diagnosis is preferably one in which the sequence is determined by a 

method selected firom amplification refractory mutation system and restriction fi-agment 
length polymorphism. 

In another aspect of the invention we provide a method for the diagnosis of PDH- 
mediated disease, which method comprises: 
20 i) obtaining sample nucleic acid fi*om an individual, 

ii) detecting the presence or absence of a variant nucleotide at one or more of positions 857 
and 1255 in the PDH E2 gene as defined by the positions in SEQ ID NO: 1; and 

iii) determining the status of the individual by reference to polymorphism in the PDH E2 
gene. 

25 Allelic variation at position 857 in the PDH E2 gene as defined by the positions in 

SEQ ID NO: 1 consists of a single base substitution fi-om T (the published base), preferably to 
C. Allelic variation at position 1255 in the PDH E2 gene as defined by the positions in SEQ 
ID NO: 1 consists of a single base substitution fi-om G (the published base), preferably to A. 
The status of the individual may be determined by reference to allelic variation at any one or 

30 both positions optionally in combination with any other polymorphism in the gene that is (or 
becomes) known. 
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The test sample of nucleic acid is conveniently present in a sample of blood, sputum, 
skin, bronchoalveolar lavage fluid, or other body fluid or tissue obtained from an individual. 
It will be appreciated that the test sample may equally comprise a nucleic acid sequence 
corresponding to the sequence in the test sample, that is to say that all or a part of the region 
in the sample nucleic acid may firstly be amplified using any convenient technique e.g. PGR, 
before analysis of allelic variation. 

It will be apparent to the person skilled in the art that there are a large number of 
analytical procedures which may be used to detect the presence or absence of variant 
nucleotides at one or more polymorphic positions of the invention. In general, the detection 
of allelic variation requires a mutation discrimination technique, optionally an amplification 
reaction and optionally a signal generation system. Table 1 lists a number of mutation 
detection techniques, some based on PGR. These may be used in combination with a number 
of signal generation systems, a selection of which is listed in Table 2. Fiuther amplification 
techniques are listed in Table 3. Many ciurent methods for the detection of allelic variation 
are reviewed by NoUau et aL, Glin. Ghem. 42, 1 1 14-1120, 1997; and in standard textbooks, 
for example "Laboratory Protocols for Mutation Detection", Ed. by U. Landegren, Oxford 
University Press, 1996 and "PGR", 2"^ Edition by Newton & Graham, BIOS Scientific 
Publishers Limited, 1997. 





• • 


Abbreviations: 
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ALEX™ 


Amplification refractory mutation system linear extension 


APEX 


Arrayed primer extension 


ARMS™ 


Amplification refractory mutation system 


b-DNA 


Branched DNA 


CMC 


Chemical mismatch cleavage 


bp 


base pair 


COPS 


Competitive oligonucleotide priming system 


DGGE 


Denaturing gradient gel electrophoresis 


FRET 


Fluorescence resonance energy transfer 


IDDM 


Insulin-dependent diabetes mellitus 


LCR 


Ligase chain reaction 


MASDA 


Muhiple allele specific diagnostic assay 


NASBA 


Nucleic acid sequence based amplification 


NIDDM 


non-insulin dependent diabetes mellitus 


OLA 


Oligonucleotide ligation assay 


PGR 


Polymerase chain reaction 


PDH 


Pyruvate Dehydrogenase 


PDK 


Pyruvate Dehydrogenase Kinase 


PDK2 


Pyruvate Dehydrogenase Kinase Isoenzyme 2 


PTT 


Protein truncation test 


RFLP 


Restriction fragment length polymorphism 


SDA 


Strand displacement amplification 


SNP 


Single nucleotide polymorphism 


SSCP 


Single-strand conformation polymorphism analysis 


SSR 


Self sustained replication 


TGGE 


Temperature gradient gel electrophoresis 


3'UTR 


Y Untranslated Region 
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Table 1 - Mutation Detection Techniques 

General: DNA sequencing, Sequencing by hybridisation 

Scanning: PTT*, SSCP, DGGE, TGGE, Cleavase, Heteroduplex analysis, CMC, Enzymatic 
mismatch cleavage 
5 * Note: not useful for detection of promoter polymorphisms. 
Hybridisation Based: 

Solid phase hybridisation: Dot blots, MASDA, Reverse dot blots. Oligonucleotide arrays 
(DNA Chips). 

Solution phase hybridisation: Taqman™ . US-5210015 & US-5487972 (Hoffinann-La 
1 0 Roche), Molecular Beacons - Tyagi et al (1996), Nature Biotechnology, 14, 303; WO 
95/13399 (Public Health Inst., New York). 

Extension Based: ARMS™, ALEX^m . European Patent No. EP 332435 Bl (Zeneca 
Limited), COPS - Gibbs et al (1989), Nucleic Acids Research, 17, 2347. 
Incorporation Based: Mini-sequencing, APEX 
1 5 Restriction Enzyme Based: RFLP, Restriction site generating PCR 
Ligation Based: OLA 
Other: Invader assay 

Table 2 - Signal Generation or Detection Systems 
20 Fluorescence: FRET, Fluorescence quenching. Fluorescence polarisation - United Kingdom 
Patent No. 2228998 (Zeneca Limited) 

Other: Chemiluminescence, Electrochemiluminescence, Raman, Radioactivity, Colorimetric, 
Hybridisation protection assay, Mass spectrometry. 

25 Table 3 - Fu rther Amplification Methods 
SSR, NASBA, LCR, SDA, b-DNA 

Table 4- Protein va riation detection methods 
Immunoassay 
30 Immunohistology 
Peptide sequencing 
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Preferred mutation detection techniques include ARMS™, ALEX™, COPS, Taqman, 
Molecular Beacons, RFLP, and restriction site based PGR and FRET techniques. 
Immunoassay techniques are known in the art e.g. A Practical Guide to ELISA by D M 
Kemeny, Pergamon Press 1991; Principles and Practice of Immunoassay, 2""* edition, G P 
5 Price & D J Newman, 1997, published by Stockton Press in USA & Ganada and by 
Macmillan Reference in the United Kingdom. Histological techniques are described in 
Theory and Practice of Histological Techniques by J D Bancroft & A Stevens, 4*** Edition, 
Ghurchill Livingstone, 1996. Protein sequencing is described in Laboratory Techniques in 
Biochemistry and Molecular Biology, Volimie 9, Sequencing of Proteins and Peptides, G 
1 0 Allen, 2"*^ revised edition, Elsevier, 1 989. 

Particularly preferred methods include ARMS™ and RFLP based methods. ARMS™ 
is an especially preferred method. 

In a further aspect, the diagnostic methods of the invention are used to assess the 
efficacy of therapeutic compounds in the treatment of PDH-mediated diseases such as 
1 5 diabetes, asthma, obesity, sepsis, and peripheral vascular disease. 

Assays, for example reporter-based assays, may be devised to detect whether one or 
more of the above polymorphisms affect transcription levels and/or message stability. 

Individuals who carry particular allelic variants of the PDH E2 gene may therefore 
exhibit differences in their ability to regulate protein biosynthesis under different 
20 physiological conditions and may display altered abilities to react to different diseases. In 
addition, differences in protein regulation arising as a result of allelic variation may have a 
direct effect on the response of an individual to drug therapy. The diagnostic methods of the 
invention may be useftil both to predict the clinical response to such agents and to determine 
therapeutic dose. 

25 In a further aspect, the diagnostic methods of the invention, are used to assess the 

predisposition of an individual to diseases mediated by PDH. This may be particularly 
relevant in the development of diabetes, asthma, obesity, sepsis, and peripheral vascular 
disease and other diseases which are mediated by PDH. The present invention may be used to 
recognise individuals who are particularly at risk from developing these conditions. 

30 Low frequency polymorphisms may be particularly useful for haplotyping as 

described below. A haplotype is a set of alleles found at linked polymorphic sites (such as 
within a gene) on a single (paternal or maternal) chromosome. If recombination within the 
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gene is random, there may be as many as 2" haplotypes, where 2 is the number of alleles at 
each polymorphic position and n is the number of polymorphic positions. One approach to 
identifying mutations or polymorphisms which are correlated with clinical response is to carry 
out an association study using all the haplotypes that can be identified in the population of 
5 interest. The frequency of each haplotype is limited by the frequency of its rarest allele, so 
that polymorphisms with low frequency alleles are particularly useful as markers of low 
frequency haplotypes. As particular mutations or polymorphisms associated with certain 
clinical features, such as adverse or abnormal events, are likely to be of low frequency within 
the population, low frequency polymorphisms may be particularly useful in identifying these 
1 0 mutations (for examples see: De Stefano V et al, Ann Hum Genet (1998) ^:481-90; and. 
Keightley AM et al.. Blood (1999) 23:4277-83). 

In a further aspect, the diagnostic methods of the invention are used in the 
development of new drug therapies which selectively target one or more allelic variants of the 
PDH E2 gene. Identification of a link between a particular allelic variant and predisposition 
15 to disease development or response to drug therapy may have a significant impact on the 
design of new drugs. Drugs may be designed to regulate the biological activity of variants 
implicated in the disease process whilst minimising effects on other variants. 

In a further diagnostic aspect of the invention the presence or absence of variant 
nucleotides is detected by reference to the loss or gain of, optionally engineered, sites 
20 recognised by restriction enzymes. 

According to another aspect of the present invention there is provided a nucleic acid 
comprising any one of the following polymorphisms: 

the nucleic acid of SEQ ID NO: 1 with C at position 857 as defined by the positions in SEQ 
ID NO: 1; 

25 the nucleic acid of SEQ ID NO: 1 with A at position 1255 as defined by the position in SEQ 
ID NO: 1; 

or a complementary strand thereof or an antisense sequence thereto or a fragment thereof of at 
least 20 bases comprising at least one polymorphism. 

Fragments are at least 17 bases, more preferably at least 20 bases, more preferably at 
30 least 30 bases. 
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The scope of the invention does not extend to any nucleic acid as it is found in nature. 
A nucleic acid of the invention is preferably in isolated form, for example through being at 
least partially purified from any substance with which it occurs naturally (if any). 

Novel sequence disclosed herein, may be used in another embodiment of the 
5 invention to regulate expression of the gene in cells by the use of antisense constructs. To 
enable methods of down-regulating expression of the gene of the present invention in 
mammalian cells, an example antisense expression construct can be readily constructed for 
instance using the pREPlO vector (Invitrogen Corporation). Transcripts are expected to 
inhibit translation of the gene in cells transfected with this type of construct. Antisense 

1 0 transcripts are effective for inhibiting translation of the native gene transcript, and capable of 
inducing the effects (e.g., regulation of tissue physiology) herein described. Oligonucleotides 
which are complementary to and hybridisable with any portion of novel gene mRNA 
disclosed herein are contemplated for therapeutic use. U.S. Patent No. 5,639,595, 
"Identification of Novel Drugs and Reagents", issued Jun. 17, 1997, wherein methods of 

1 5 identifying oligonucleotide sequences that display in vivo activity are thoroughly described, 
is herein incorporated by reference. Expression vectors containing random oligonucleotide 
sequences derived firom previously known polynucleotides are transformed into cells. The 
cells are then assayed for a phenotype resulting from the desired activity of the 
oligonucleotide. Once cells with the desired phenotype have been identified, the sequence of 

20 the oligonucleotide having the desired activity can be identified. Identification may be 

accomplished by recovering the vector or by polymerase chain reaction (PGR) amplification 
and sequencing the region containing the inserted nucleic acid material. Antisense molecules 
can be synthesised for antisense therapy. These antisense molecules may be DNA, stable 
derivatives of DNA such as phosphorothioates or methylphosphonates, RNA, stable 

25 derivatives of RNA such as 2'-0-alkyIRNA, or other oligonucleotide mimetics. U.S. Patent 
No. 5,652,355, "Hybrid Oligonucleotide Phosphorothioates", issued July 29, 1997, and U.S. 
Patent No. 5,652,356, "Inverted Chimeric and Hybrid Oligonucleotides", issued July 29, 
1997, which describe the synthesis and effect of physiologically-stable antisense molecules, 
are incorporated by reference. Antisense molecules may be introduced into cells by 

30 microinjection, liposome encapsulation or by expression from vectors harboring the antisense 
sequence. 
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The invention further provides nucleotide primers which can detect the 
polymorphisms of the invention. 

According to another aspect of the present invention there is provided an allele 
specific primer capable of detecting a PDH E2 gene polymorphism at one or more of positions 
5 857 and 1255 in the PDH E2 gene as defined by the positions in SEQ ID NO: 1. 

An allele specific primer is used, generally together with a constant primer, in an 
amplification reaction such as a PGR reaction, which provides the discrimination between 
alleles through selective amplification of one allele at a particular sequence position e.g. as 
used for ARMS™ assays. The allele specific primer is preferably 17- 50 nucleotides, more 
10 preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

An allele specific primer preferably corresponds exactly with the allele to be detected 
but derivatives thereof are also contemplated wherein about 6-8 of the nucleotides at the 3' 
terminus correspond with the allele to be detected and wherein up to 10, such as up to 8, 6, 4, 
2, or 1 of the remaining nucleotides may be varied without significantly affecting the 
1 5 properties of the primer. 

Primers may be manufactured using any convenient method of synthesis. Examples of 
such methods may be found in standard textbooks, for example "Protocols for 
Oligonucleotides and Analogues; Synthesis and Properties," Methods in Molecular Biology 
Series; Volume 20; Ed. Sudhir Agrawal, Humana ISBN: 0-89603-247-7; 1993; V Edition. If 
20 required the primer(s) may be labelled to facilitate detection. 

According to another aspect of the present invention there is provided an allele- 
specific oligonucleotide probe capable of detecting a PDH E2 gene polymorphism at one or 
more of positions 857 and 1255 in the PDH E2 gene as defined by the positions in SEQ ID 
NO: 1. 

25 The allele-specific oligonucleotide probe is preferably 17- 50 nucleotides, more 

preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

The design of such probes will be apparent to the molecular biologist of ordinary skill. 
Such probes are of any convenient length such as up to 50 bases, up to 40 bases, more 
conveniently up to 30 bases in length, such as for example 8-25 or 8-15 bases in length. In 

30 general such probes will comprise base sequences entirely complementary to the 

corresponding wild type or variant locus in the gene. However, if required one or more 
mismatches may be introduced, provided that the discriminatory power of the oligonucleotide 
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probe is not unduly affected. The probes of the invention may carry one or more labels to 
faciUtate detection. 

According to another aspect of the present invention there is provided a diagnostic kit 
comprising an allele specific oligonucleotide probe of the invention and/or an allele-specific 
5 primer of the invention. 

The diagnostic kits may comprise appropriate packaging and instructions for use in the 
methods of the invention. Such kits may further comprise appropriate buffer(s), nucleotides, 
and polymerase(s) such as thermostable polymerases, for example taq polymerase. 



10 a genetic markers in a linkage study. This particularly applies to the polymorphism at 

position 1255 in the PDH E2 gene as defined by the positions in SEQ ID NO: 1 because of its 
relatively high frequency (see Examples below). 

According to another aspect of the present invention there is provided a method of 
treating a human in need of treatment with a PDH drug in which the method comprises: 

15 i) diagnosis of a polymorphism in PDH E2 in a human, which method comprises 

determining the sequence of the human at one or more of positions 857 and 1255 in the PDH 
E2 gene as defined by the positions in SEQ ID NO: 1 or positions 216 or 349 of PDH E2 
protein as defined by the position in SEQ ID NO: 2; and determining the status of the human 
by reference to polymorphism in the PDH E2 gene; and 

20 ii) administering an effective amount of a PDH drug. 

According to another aspect of the present invention there is provided a method of 
treating a human in need of treatment with a PDH drug in which the method comprises: 

i) diagnosis of a polymorphism in the PDH E2 gene in the human, which diagnosis 
comprises determining the sequence of the nucleic acid at one or more of positions 857 and 

25 1255 in the PDH E2 gene as defined by the positions in SEQ ID NO: 1, and determining the 
status of the human by reference to polymorphism in the PDH E2 gene; and 

ii) administering an effective amount of a PDH drug. 

Preferably determination of the status of the human is clinically useful. Examples of 
clinical usefulness include deciding which drug or drugs to administer and/or establishing the 
30 effective amount of the drug or drugs. 

Drugs which increase the activity of PDH are of value in a number of disease 
conditions, including disease states associated with disorders of glucose utilisation such as 



In another aspect of the invention, the polymorphisms of this invention may be used as 
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diabetes and obesity, and associated with excessive production of lactate such as encountered 
in sepsis and other causes of lactic acidaemia. Additionally drugs which increase the activity 
of PDH may be expected to have utility in diseases where supply of energy-rich substrates to 
tissues is limiting such as peripheral vascular disease, coronary failure and certain cardiac 
5 myopathies, muscle ataxia and weakness. 

According to another aspect of the present invention there is provided use of a PDH 
drug in the preparation of a medicament for treating a PDH-mediated disease in a human 
diagnosed as having a polymorphism at one or more of positions 857 and 1255 in the PDH E2 
gene as defined by the positions in SEQ ID NO: 1. 

1 0 According to another aspect of the present invention there is provided use of a PDH 

drug in the preparation of a medicament for treating a PDH-mediated disease in a human 
diagnosed as having a a polymorphism in PDH E2 in a himian, which method comprises 
determining the sequence of the human at one or more of positions 857 and 1255 in the PDH 
E2 gene as defined by the positions in SEQ ID NO: 1 or positions 216 or 349 of PDH E2 

1 5 protein as defined by the position in SEQ ID NO: 2. 

According to another aspect of the present invention there is provided a 
pharmaceutical pack comprising a PDH drug and instructions for administration of the drug to 
humans diagnostically tested for a polymorphism at one or more of positions 857 and 1255 in 
the PDH E2 gene as defined by the positions in SEQ ID NO: 1 . 

20 According to another aspect of the present invention there is provided a 

pharmaceutical pack comprising a PDH drug and instructions for administration of the drug to 
humans diagnostically tested for a polymorphism in PDH E2 in a human, which method 
comprises determining the sequence of the hxmian at one or more of positions 857 and 1255 in 
the PDH E2 gene as defined by the positions in SEQ ID NO: 1 or positions 216 or 349 of 

25 PDH E2 protein as defined by the position in SEQ ID NO: 2. 

According to another aspect of the present invention there is provided a computer 
readable medium comprising at least one novel polynucleotide sequence of the invention 
stored on the medivun. The computer readable medium may be used, for example, in 
homology searching, mapping, haplotyping, genotyping or pharmacogenetic analysis or any 

30 other bioinformatic analysis. The reader is referred to Bioinformatics, A practical guide to the 
analysis of genes and proteins, Edited by A D Baxevanis & B F F Ouellette, John Wiley & 
Sons, 1998. Any computer readable mediiun may be used, for example, compact disk, tape. 
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floppy disk, hard drive or computer chips. 

The polynucleotide sequences of the invention, or parts thereof, particularly those 
relating to and identifying the polymorphisms identified herein represent a valuable 
information source, for example, to characterise individuals in terms of haplotype and other 
5 sub-groupings, such as investigation of susceptibility to treatment with particular drugs. 
These approaches are most easily facilitated by storing the sequence information in a 
computer readable medium and then using the information in standard bioinformatics 
programs or to search sequence databases using state of the art searching tools such as 
"GCC". Thus, the polynucleotide sequences of the invention are particularly useful as 

1 0 components in databases useful for sequence identity and other search analyses. As used 
herein, storage of the sequence information in a computer readable medium and use in 
sequence databases in relation to ^polynucleotide or polynucleotide sequence of the invention' 
covers any detectable chemical or physical characteristic of a polynucleotide of the invention 
that may be reduced to, converted into or stored in a tangible medium, such as a computer 

1 5 disk, preferably in a computer readable form. For example, chromatographic scan data or 
peak data, photographic scan or peak data, mass spectrographic data, sequence gel (or other) 
data. 

The invention provides a computer readable medium having stored thereon one or 

< 

more polynucleotide sequences of the invention. For example, a computer readable medium 
20 is provided comprising and having stored thereon a member selected from the group 

consisting of: a polynucleotide comprising the sequence of a polynucleotide of the invention, 
a polynucleotide consisting of a polynucleotide of the invention, a polynucleotide which 
comprises part of a polynucleotide of the invention, which part includes at least one of the 
polymorphisms of the invention, a set of polynucleotide sequences wherein the set includes at 
25 least one polynucleotide sequence of the invention, a data set comprising or consisting of a 
polynucleotide sequence of the invention or a part thereof comprising at least one of the 
polymorphisms identified herein. 

A computer based method is also provided for performing sequence identification, 
said method comprising the steps of providing a polynucleotide sequence comprising a 
30 polymorphism of the invention in a computer readable medium; and comparing said 

polymorphism containing polynucleotide sequence to at least one other polynucleotide or 
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polypeptide sequence to identify identity (homology), i.e. screen for the presence of a 
polymorphism. 

Another aspect of the invention comprises use of any polymorphism as defined herein 
in bioinformatic analysis. Preferably the bioinformatic analysis is selected fi-om homology 
5 searching, mapping, haplotyping, genotyping or pharmacogenetic analysis. 

According to another aspect of the present invention there is provided an allelic variant 
of the human PDH E2 polypeptide having a alanine at position 216 and/or an asparagine at 
position 349 or a firagment thereof comprising at least 10 amino acids provided that the 
fi*agment comprises the allelic variant at position 216 and/or position 349, 
10 Fragments of PDH E2 polypeptide are at least 10 amino acids, more preferably at least 

15 amino acids, more preferably at least 20 amino acids. The polypeptide of the invention 
does not encompass naturally occuring polypeptide as it occurs in nature, for example, the 
polypeptide is at least partially purified firom at least one component with which it occurs 
naturally. Preferably the polypeptide is at least 30% pure, more preferably at least 60% pure, 
1 5 more preferably at least 90% pure, more preferably at least 95% pure, and more preferably at 
least 99% pure. 

According to another aspect of the present invention there is provided an antibody 
specific for an allelic variant of human PDH E2 polypeptide having a alanine at position 216 
and/or an asparagine at position 349 or a firagment thereof comprising at least 10 amino acids 

20 provided that the fragment comprises the allelic variants at position 216 and /or position 349. 
Antibodies can be prepared using any suitable method. For example, purified 
polypeptide may be utilised to prepare specific antibodies. The term "antibodies" is meant to 
include polyclonal antibodies, monoclonal antibodies, and the various types of antibody 
constructs such as for example F(ab')2, Fab and single chain Fv. Antibodies are defined to be 

25 specifically binding if they bind the T679M variant of integrin a4 with a of greater than or 
equal to about 10^ M*. Affinity of binding can be determined using conventional techniques, 
for example those described by Scatchard et aL, Ann, KY, Acad, ScL, 51:660 (1949). 

Polyclonal antibodies can be readily generated from a variety of sources, for example, 
horses, cows, goats, sheep, dogs, chickens, rabbits, mice or rats, using procedures that are 

30 well-known in the art. In general, antigen is administered to the host animal typically through 
parenteral injection. The immunogenicity of antigen may be enhanced through the use of an 
adjuvant, for example, Freund's complete or incomplete adjuvant. Following booster 
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immunisations, small samples of serum are collected and tested for reactivity to antigen. 
Examples of various assays useful for such determination include those described in: 
Antibodies: A Laboratory Manual^ Harlow and Lane (eds,). Cold Spring Harbor Laboratory 
Press, 1988; as well as procedures such as countercurrent immuno-electrophoresis (CIEP), 
5 radioimmunoassay, radioimmunoprecipitation, enzyme-linked immuno-sorbent assays 

(ELISA), dot blot assays, and sandwich assays, see U.S. Patent Nos. 4,376,1 10 and 4,486,530. 

Monoclonal antibodies may be readily prepared using well-known procedures, see for 
example, the procedures described in U.S. Patent Nos. RE 32,01 1, 4,902,614, 4,543,439 and 
4,411,993; Monoclonal Antibodies, Hybridomas: A New Dimension in Biological Analyses, 

1 0 Plenum Press, Kennett, McKeam, and Bechtol (eds.), (1980). 

The monoclonal antibodies of the invention can be produced using alternative 
techniques, such as those described by Alting-Mees et al., "Monoclonal Antibody Expression 
Libraries: A Rapid Alternative to Hybridomas", Strategies in Molecular Biology 1: 1-9 (1990) 
which is incorporated herein by reference. Similarly, binding partners can be constructed 

1 5 using recombinant DNA techniques to incorporate the variable regions of a gene that encodes 
a specific binding antibody. Such a technique is described in Larrick et al.. Biotechnology, 2: 
394 (1989). 

Once isolated and purified, the antibodies may be used to detect the presence of 
antigen in a sample using established assay protocols. 
20 The invention will now be illustrated but not limited by reference to the following 

Examples. All temperatures are in degrees Celsius. 

In the Examples below, unless otherwise stated, the following methodology and 
materials have been applied. 

AMPLITAQ'^M or AMPLITAQ GOLD™ available firom Perkin-Ehner Cetus, are used 
25 as the source of thermostable DNA polymerase. 

General molecular biology procedures can be followed fi-om any of the methods 
described in "Molecular Cloning - A Laboratory Manual" Second Edition, Sambrook, Fritsch 
and Maniatis (Cold Spring Harbor Laboratory, 1989) or "Current Protocols in Molecular 
Biology", Volumes 1-3 , Edited by F M Asubel, R Brent & R E Kingston, published by John 
30 Wiley, 1998, 
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Electropherograms were obtained in a standard manner: data was collected by ABI377 
data collection software and the wave form generated by ABI Prism*^^ sequencing analysis 
(2.1.2). 
Example 1 
5 Identification of Polymorphisms 
1. Methods 
c-DNA Preparation 

RNA was prepared from lymphoblastoid cell lines from Caucasian donors using 
standard laboratory protocols (Chomczynski and Sacchi, Anal. Biochem, 162 . 156-159, 1987) 
1 0 and used to generate first strand cDNA (Gubler and Hofftnan, Gene 25, 263-269, 1983). 
Template Preparation 

Templates were prepared by PGR using the oligonucleotide primers and aimealing 
temperatures set out below. The extension temperature was 72° and denaturation temperature 
94°; each step was 1 minute. Generally 100 pg cDNA was used in each reaction and 



1 5 subjected to 40 cycles of PGR. 



Fragment 


Forward 
Oligo 


Reverse 
Oligo 


Annealing 
Temp 


DMSO 


MgCl, 


176-579 


176-198 


559-579 


55° 


5% 


1.5mM 


455-998 


455-475 


975-998 


60° 


0% 


1.5niM 


815-1206 


815-836 


1185-1206 


62° 


0% 


ImM 


1109-1592 


1109-1128 


1570-1592 


62° 


0% 


ImM 


1429-1893 


1429-1450 


1871-1893 


62° 


0% 


ImM 


All positions refer to the positions in SEQ ED N 


fO: 1. 



For dye-primer sequencing the forward primers were modified to include Ml 3 forward 
sequence (ABI protocol P/N 4021 14, Applied Biosystems) at the 5' end of the 
oligonucleotides. 



20 Dye Primer Sequencing 

Dye-primer sequencing using Ml 3 forward primer was as described in the ABI 
protocol P/N 4021 14 for the ABI Prism™ dye primer cycle sequencing core kit with 
"AmpliTaq FS"^'*^ DNA polymerase, modified in that the annealing temperature was 45° and 
DMSO was added to the cycle sequencing mix to a final concentration of 5%. 
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The extension reactions for each base were pooled, ethanol/sodium acetate 
precipitated, washed and resuspended in formamide loading buffer. 

4.25% Acrylamide gels were run on an automated sequencer (ABI 377, AppUed 
Biosystems). 



5 2. Results 

Novel Polymorphisms 



Position 


Published 
allele 


Variant 
allele 


amino acid 
change 


RFLP 


Allele 
Frequency 


857 


T 


C 


Val-Ala 


+ HhaI 


11/40 


1255 


G 


A 


Asp-Asn 


-Clal 


17/38 



All positions refer to the positions in SEQ ID NO: 1. 
Frequency is the allele firequency of the variant allele in control subjects. 
Summary of polymorphisms: 



10 Position 857 ofSEQ ID NO: 1, GIG (Val at 216 of SEQ ID NO: 2) to G£G (Ala at 216 of 
SEQ ID NO: 2); and 

Position 1255 of SEQ ID NO: 1 fiAT (Asp at 349 of SEQ ID NO: 2) to AAT (Asn at 349 of 
SEQ ID NO: 2). 
Example 2 

1 5 Analysis of the sequences of PDH E2 disclosed in public databases 

Two cDNA sequences encoding PDH E2 have been submitted to public databases 
under accession numbers Y00978 and J03866. Y00978 identifies an open reading frame 
(ORF) which extends previously published sequence at the 5' end but does not identify an 
ATG start site. J03866 contains more 5' sequence than Y00978 and identifies an ATG start 
20 site. However, the 5' sequence of J03866 does not match that of Y00978. 

We used sequence regions common to Y00978 and 103 866 to search public and 
proprietary sequence databases and were able to obtain extended sequence at the 5' end. The 
extended sequence identified the start of an open reading firame which was in frame with the 
ORFs reported within both Y00978 and J03866. The initiating ATG corresponded to position 
25 21 1 of Y00978, position 843 of J03866. 

Sequence at positions 1-698 of J03866 did not match the extended sequence 
determined in our analysis. Sequence J03866 1-698 was found to be identical to murine 
sulphatase A. Furthermore, we were unable to amplify the predicted PGR product sizes from 
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human lymphoblastoid cell line cDNA using PGR primers designed from sequence J03866 1- 
698, providing further confirmation that sequence JOS 866 1-698 is incorrect. J03866 also 
differs from Y00978 by three missing nucleotides. 

Sequence analysis of lymphoblastoid cell line cDNA agreed with the sequence of 
5 Y00978. In conclusion, we believe that sequence Y00978 is correct and that the ATG start 
site corresponds to position 211 of Y00978. 



